Interpreting Dynamic Scenes by a Physics Engine and Bottom-Up Visual Cues

نویسندگان

  • Ilker Yildirim
  • Jiajun Wu
  • Yilun Du
  • Joshua B. Tenenbaum
چکیده

Humans have the remarkable ability to infer many different physical properties of objects from observing dynamic scenes. Here, we propose a generative model for inferring physical object properties from real world videos of objects splashing in the water. At the core of our generative model is a 3D physics engine which models the object’s interactions with the environment based on parameters such as density, surface area, shape, and volume. We infer these properties through the combination of bottom-up visual cues and a MCMC algorithm, which drives the physical simulation to match real world observations. Results show that our model makes accurate estimations of unobserved physical properties such as mass and density.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual causes versus correlates of attentional selection in dynamic scenes

What are the visual causes, rather than mere correlates, of attentional selection and how do they compare to each other during natural vision? To address these questions, we first strung together semantically unrelated dynamic scenes into MTV-style video clips, and performed eye tracking experiments with human observers. We then quantified predictions of saccade target selection based on seven ...

متن کامل

Learning to See Physics via Visual De-animation

We introduce a paradigm for understanding physical scenes without human annotations. At the core of our system is a physical world representation that is first recovered by a perception module and then utilized by physics and graphics engines. During training, the perception module and the generative models learn by visual de-animation — interpreting and reconstructing the visual information st...

متن کامل

Impact of dynamic bottom-up features and top-down control on the visual exploration of moving real-world scenes in hemispatial neglect.

Patients with hemispatial neglect are severely impaired in orienting their attention to contralesional hemispace. Although motion is one of the strongest attentional cues in humans, it is still unknown how neglect patients visually explore their moving real-world environment. We therefore recorded eye movements at bedside in 19 patients with hemispatial neglect following acute right hemisphere ...

متن کامل

A Reactive Vision System: Active-Dynamic Saliency

We develop an architecture for reactive visual analysis of dynamic scenes. We specify a minimal set of system features based upon biological observations. We implement feature on a processing network based around an active stereo vision mechanism. Active rectification and mosaicing allows static stereo algorithms to operate on the active platform. Foveal zero disparity operations permit attende...

متن کامل

Police Interpreting: A View from the Australian Context

In the global village of today, more people have been moving and migrating than ever before creating a need for better communication. Thus community interpreting rose as a specialization serving the needs of members of the community who are unable to communicate with the system. Within this broad field of interpreting the specialist area of legal interpreting assumed a high position. However, l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016